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ABSTRACT 

Nucleosome positioning on the chromatin strand 
plays a critical role in regulating accessibility of 
DNA to transcription factors and chromatin modify- 
ing enzymes. Hence, detailed information on nu- 
cleosome depletion or movement at c/s-acting 
regulatory elements has the potential to identify pre- 
dicted binding sites for frans-acting factors. Using a 
novel method based on enrichment of mononu- 
cleosomal DNA by bacterial artificial chromosome 
hybridization, we mapped nucleosome positions by 
deep sequencing across 250 kb, encompassing the 
cystic fibrosis transmembrane conductance regula- 
tor [CFTR) gene. CFTR shows tight tissue-specific 
regulation of expression, which is largely deter- 
mined by c/s-regulatory elements that lie outside 
the gene promoter. Although multiple elements are 
known, the repertoire of transcription factors that 
interact with these sites to activate or repress 
CFTR expression remains incomplete. Here, we 
show that specific nucleosome depletion corres- 
ponds to well-characterized binding sites for 
known frans-acting factors, including hepatocyte 
nuclear factor 1, Forkhead box A1 and CCCTC- 
binding factor. Moreover, the cell-type selective 
nucleosome positioning is effective in predicting 
binding sites for novel interacting factors, such 
as BAF155. Finally, we identify transcription fac- 
tor binding sites that are overrepresented in 
regions where nucleosomes are depleted in a 
cell-specific manner. This approach recognizes the 



glucocorticoid receptor as a novel frans-acting 
factor that regulates CFTR expression in vivo. 

INTRODUCTION 

Recent progress in generating maps of open chromatin 
genome wide (1,2) has greatly accelerated the discovery 
of regulatory mechanisms both for individual genes and 
for coordinated transcriptional networks. However, 
advances in the technologies that enable de novo charac- 
terization of trans-acting factors that interact with these 
regions of open chromatin are still in development (3-5). 
Until recently, prediction of transcription factor binding 
sites in genomic DNA was largely dependent on the use of 
in silico approaches followed by in vivo verification by 
chromatin immunoprecipitation (ChlP) using specific 
antibodies for individual factors. As the regions of open 
chromatin identified by DNase-seq (DNase I hypersensi- 
tivity mapping followed by deep sequencing) and FA1RE 
(formaldehyde-assisted identification of regulatory 
elements) can encompass > 1 kb of DNA sequence, it 
remains a challenge to characterize the cw-regulatory 
elements that are responsible for the chromatin landscape. 
One approach that can assist in this effort is to fine map 
the location of nucleosomes across the region of open 
chromatin, as nucleosome positioning is known to play 
a critical role in accessibility of the DNA strand to inter- 
acting proteins (6,7). The dynamic competition between 
DNA unwinding, nucleosome positioning and transcrip- 
tion factor binding is not yet fully understood (8,9). 
However, there are many cases, for example, the 
androgen receptor (4,10), where loss of a specific nucleo- 
some can be directly correlated with occupancy of a regu- 
latory element by one or more of its cognate protein 
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partners. Here, we combine the techniques of DNase-seq 
and a novel method to fine map nucleosome positions by 
bacterial artificial chromosome (BAC) enrichment of 
mononucleosomal DNA and deep sequencing, to identify 
trans-acting factors regulating cystic fibrosis transmem- 
brane conductance regulator (CFTR) gene expression. 
The CFTR gene spans 1 89 kb of genomic DNA at chromo- 
some 7q31 (11), although regulatory elements for the locus 
map to regions far upstream and downstream of the gene, 
in addition to within introns [reviewed in (12)]. Hence, to 
fully understand the CFTR cw-regulatory sequences and 
trans-acting factors that control cell-specific expression of 
the gene, a region of >250 kb must be evaluated. The avail- 
ability of a CFTR BAC clone that includes the majority of 
the critical regions (13) greatly facilitated this analysis. 

CFTR shows tight tissue-specific regulation of expres- 
sion, which is largely determined by m-regulatory 
elements that lie outside the gene promoter (14). We 
recently showed that the active locus exists in a looped con- 
formation that brings distal regulatory elements, located 
within cell-type-specific DNase I hypersensitive sites 
(DHS) into close proximity with the gene promoter (15). 
Moreover, we showed that the regulatory mechanisms in 
airway epithelial cells, which generally express low levels of 
CFTR in comparison with intestinal epithelial cells, achieve 
this through different ra-acting elements (15,16). However, 
in either cell type, the identification of the critical trans- 
acting factors that activate these elements remains incom- 
plete. Several methods have recently been developed to ac- 
curately map transcription factor binding sites including 
direct methods, such as high-resolution in vivo footprinting 
by DNase-seq, (3,4) and indirect methods provided by 
mapping nucleosomes in vivo (17). Here, we use a novel 
method (Yigit et dl., 2012, in review) to search for cell-type- 
-specific depletion of individual nucleosomes at the cores of 
regions of open chromatin that correspond to well- 
characterized regulatory elements associated with DHS in 
the CFTR gene. We find specific loss of nucleosomes that 
correspond to well-characterized binding sites for known 
trans-actvng factors. Next, we identify predicted binding 
sites for factors that have not previously been implicated 
in CFTR regulation, which interact with nucleosome 
depleted regions in vivo and demonstrate occupancy of 
these factors in relevant cell types by ChlP. Finally, we 
use a bioinformatic approach to identify transcription 
factor binding sites that are overrepresented in regions 
where a nucleosome is lost in a cell-specific manner in a 
bronchial epithelial cell line (16HBE14o-) in comparison 
with skin fibroblasts. This approach predicts sites for 
several factors including the glucocorticoid receptor 
(GR), and we show by ChlP that dexamethasone-induced 
occupancy of multiple GR elements (GREs) causes repres- 
sion of CFTR transcription. 

MATERIALS AND METHODS 

Cell culture 

The human colon carcinoma cell lines Caco2 (18) and 
16HBE14o- bronchial epithelial cells (19), were grown 
in Dulbecco's modified Eagle's medium (Invitrogen) 



supplemented with 10% FBS. Primary skin fibroblasts 
(GM08333) were grown in MEM (Invitrogen) supple- 
mented with 15% FBS. NHBE cells, a mixture of 
primary human bronchial and tracheal epithelial cells 
(Lonza, CC-2541), were cultured in BEGM (Lonza) as 
per the manufacturer's instructions. Two independent bio- 
logical samples were analysed, NHBE1 (7F3506) and 
NHBE2 (8F3407). Primary human fetal epididymis cells 
were cultured in CMRL1066 medium with 10% FCS (20). 

For glucocorticoid receptor binding experiments, 
16HBE14o- cells at 100% confluence were treated with 
100 nM dexamethasone (Sigma-Aldrich, St. Louis, MO, 
USA) in ethanol or the same volume of ethanol alone 
for 3h and then harvested for ChlP or RNA extraction. 
CFTR mRNA was assayed by quantitative reverse tran- 
scriptase-polymerase chain reaction (RT-qPCR) as 
described previously (21). 

Isolation of nuclei, MNase digestion and DNA purification 

Cells were harvested with trypsin or non-enzymatic dis- 
sociation solution (NHBE), washed twice in chilled 
phosphate-buffered saline at 4°C and pelleted by centrifu- 
gation at 200 g for 1 min. Cells were then resuspended in 
ice-cold NP-40 lysis buffer [lOmMTris, pH 7.4, 10 mM 
NaCl, 3mM MgCl 2 , lmMCaCl 2 , 0.5% Nonidet P-40, 
0.15 mM spermine, 0.5 mM spermidine supplemented 
with one complete ethylenediaminetetraacetic acid 
(EDTA)-free protease inhibitor cocktail/50 ml] for 5 min 
on ice. Nuclei were pelleted at 19 g for 10 min in a 
pre-cooled swinging bucket rotor at 4°C. Nuclei were 
gently re-suspended in micrococcal nuclease (MNase) di- 
gestion buffer (lOmMTris, pH 7.4, 60mMKCl, 
15mMNaCl, 0.15mM spermine, 0.5mM spermidine sup- 
plemented with one complete EDTA-free protease inhibi- 
tor cocktail/ 50 ml) and pelleted again as described earlier 
in the text. For micrococcal nuclease digestion, nuclei were 
resuspended in 1 ml MNase buffer and then adjusted to 
4 x 10 7 nuclei/ml in MNase buffer, on ice. MNase diges- 
tion was performed at 37°C in the presence of 1 mM CaCl 2 
until the 147-bp (core nucleosome) and 167-bp 
(chromatosome) bands were clearly identifiable. The di- 
gestion was stopped with a final concentration of 
lOmMEDTA and 1% sodium dodecyl sulphate (SDS). 
After the digestion, the nucleosomes were treated with 
RNase A at 37°C for 30 min, and proteins were digested 
by proteinase K treatment. DNA was extracted with 
phenol/chloroform, precipitated with ethanol and resus- 
pended in 1 x TE buffer. 

Gel extraction and purification of mononucleosome DNA 

Digested chromatin DNA was separated on 3% agarose 
gel (NusieveGTG, Lonza) in 1 x TBE buffer until 147- 
and 167-bp DNA bands were clearly identifiable. The 
147-bp DNA band was carefully excised from the 
agarose gel, and it was eluted in crush and soak buffer 
(300 mM sodium acetate, 1 mM EDTA) with gentle over- 
night agitation. The excess agarose was removed by cen- 
trifugation, and supernatant containing nucleosome DNA 
was cleaned up by QIAquick PCR Purification Kit 
(Qiagen). 
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Preparation of SOLiD library 

About 6-7 ug of DNA was end repaired and ligated to the 
forward (PI) and reverse (P2) adaptors according to the 
SOLiD System library preparation protocol using 
reagents from New England Biolabs (NEB). PI adaptors 
had unique 6-bp barcodes at their 3'-ends to distinguish 
nucleosomes from each of the six cell types analysed. 
Ligation products were separated by 5% native polyacryl- 
amide gel, and they were extracted by the crush and soak 
method described earlier in the text. DNA was further 
purified and concentrated by QIAquick PCR purification 
kit and then nick translated by DNA polymerase I (NEB) 
as described in the SOLiD System library preparation 
protocol. The reaction was stopped with buffer PB and 
purified by QIAquick PCR purification kit. 

BAC DNA isolation and enrichment of targets 
by BAC hybridization 

BAC 123s has a 250.3-kb insert containing the whole- 
coding region of the CFTR gene, also 40. 1 kb of DNA 5' 
and 25 kb 3' to the gene (13). BAC DNA was isolated 
using the PureLink HiPure Midi Kit (Life Technologies) 
according to the manufacturer's protocol and verified by 
digestion with NotI and Nrul followed by pulsed-field gel 
electrophoresis (13). Enrichment of SOLiD nucleosome 
libraries (2ug) was done essentially as described previ- 
ously (22). In summary, ~150ng of BAC DNA was 
labelled with biotin-dUTP (Enzo Life Sciences) and 
[a- 32 P] dCTP using a nick translation kit (Roche), accord- 
ing to the manufacturer's protocol at 15°C for 1 h. [a- 32 P] 
was used to indirectly determine the incorporation of 
biotin-dUTP. The biotinylated products were purified 
through P-30 columns (Bio-Rad) and lyophilized. 
Labelled BAC was dissolved in Cot-1 DNA (2ug/ul) 
(Invitrogen). The solution was overlaid with mineral oil, 
denatured by incubation at 95°C for 5min and incubated 
at 65°C for 15min. Five microlitres of 2 x hybridization 
buffer (22) was added and incubated at 65° C for 6h. 

To capture nucleosome DNA, 1-2 ug of adapter-ligated 
genomic DNA in 5 ul of water was overlaid with mineral 
oil, 5 ul of 2 x hybridization buffer was added and then the 
entire sample was transferred to the tube containing the 
Cot-1 blocked BAC DNA. The hybridization reaction was 
incubated at 65°C for a further 72-80 h. In all, 150ul of 
streptavidin-coated beads (Invitrogen) were washed twice 
with 200 ul streptavidin bead-binding buffer (lOmMTris- 
HC1, pH 7.5, 1 mM EDTA, pH 8, and 1 M NaCl) and then 
resuspended in 1 50 ul of the same buffer. Next, the whole 
hybridization reaction was added to the streptavidin bead 
suspension and bound at room temperature for 30 min on 
a rotator. Streptavidin beads were removed from the 
binding buffer and washed once, at 25°C for 15 min in 
lml of 1 x SSC with 0.1% SDS, and then three times, 
each at 65°C for 15min in lml 0.1 x SSC with 0.1% 
SDS. The nucleosome DNA hybridized to the labelled 
BAC DNA was eluted by addition of 100 ul of 
O.lMNaOH at 25°C for lOmin. Eluted DNA solution 
was neutralized by addition of 100 ul of lMTris-HCl 
(pH 7.5) and desalted by using a P-30 column. 



After enrichment, DNA was amplified by PCR (18 
cycles) using PI and P2 primers, and PCR products 
were gel extracted. Enriched paired-end libraries were 
combined in equal molar amount and sequenced on 
SOLiD System 4 (ABI, Carlsbad, CA, USA) instrument 
by paired-end chemistry (35/25 bp) using a quarter of the 
slide. 

Sequence alignments 

After sequencing, the colourspace reads were grouped by 
barcode and aligned to the human reference genome 
(GRCh37/hgl9) using BioScope version 1.3.1 from ABI 
(Carlsbad, CA, USA). After alignment, the BAM files 
were processed, and a text file was generated, which 
contains, for each read, the genomic coordinates, strand, 
sequence, alignment length and the number of locations 
the read has aligned to in the genome. Only the uniquely 
mapped reads were kept for the subsequent analyses. To 
construct the nucleosome occupancy score, we further 
filtered out the reads of length between 137 and 157 bp. 
Let Rj be the number of sequencing reads centred at 
genomic location /'. We defined the centre weighted nu- 
cleosome occupancy score at location / as follows: 

0i = J2 W-/;20), (i) 

|./-/|<60 

where G is a Gaussian density with standard deviation 20. 
This Gaussian weight function covers ± 60 bp of the pro- 
jected nucleosome centre and decays towards the two 
ends. Compared with uniform weight function, the 
Gaussian function is effective in avoiding spikes in the 
reads occupancy curve frequently observed in the linker 
region because of overlap of reads arising from two 
neighbouring nucleosomes, and thus provides a sharper 
boundary of closely positioned adjacent nucleosomes. 

Bioinformatic identification of overrepresented 
transcription factor binding sites in nucleosome 
depleted regions 

The paired-end SOLiD reads that mapped to the CFTR 
locus on Genome Reference Consortium Human Build 37 
(GRCh37) were compared across different cell types by 
using the edgeR Bioconductor package (23). The edgeR 
package was initially designed for examining differential 
expression of replicated count data, but it is equally ap- 
plicable to analysis of high-throughput sequencing of 
nucleosome-depleted regions. First, discrete genomic 
intervals were defined over which to compare the 
number of overlapping sequence reads. These (~25 k) 
intervals consisted of a 100-bp window starting at the be- 
ginning of the CFTR locus and sliding forwards 10 bp at a 
time until reaching the end of the locus (chr7: 1 17 082 000- 
117332130). Next, the number of sequencing reads that 
overlapped (at the midpoint of the sequence) with each of 
these intervals, in each cell type, was computed with a perl 
program. Finally, edgeR was run on the resultant 
sequence counts for each of these intervals, comparing 
each cell type with all other cell types to determine 
which intervals were consistently nucleosome depleted, 
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or were depleted in one cell type (e.g. 16HBE14o-) 
compared with another cell type (e.g. skin fibroblasts). 
The edgeR program output all intervals that were repre- 
sented at a statistically different level between cell types, 
and different /"-value cut-offs were then used to select the 
most significant differences between cell types. Next, these 
regions were analysed with the Clover program for iden- 
tification of functional sequence motifs in DNA sequences 
(24). The CFTR locus was used as the background 
sequence for these analyses. 

Chromatin immunoprecipitation 

ChIP experiments were carried out by standard protocols. 
The following antibodies were used: GR (SantaCruz 
scl003); FOXA1 (Abeam, ab5089) and BAF155 
(scl0756). Rabbit and goat IgG were from Santa Cruz 
(sc-2027 and sc-2028). All primers used for qPCR are 
listed in Supplementary Table SI. 

RESULTS 

Statistics of SOLiD DNA sequence analysis 

Data summarizing the sequence analysis for the six cell 
types (including two biological replicas of the primary 
NHBE cells) are shown in Table 1. Total unique 
sequence tags mapping to the hgl9 human genome build 
ranged from 1412 498 for primary epididymis cells to 
11 411 833 for 16HBE14o- cells, with the range likely re- 
flecting absolute amounts of barcoded, purified 
mononucleosomal DNA in the complex mixture that 
was sequenced. The unique tags within the CFTR BAC 
ranged from 1 040 69 1 to 6 936 596, respectively, in the 
same samples, demonstrating percentage enrichment 
efficiencies of 74-61%. The lowest enrichment efficiency 
was 34% in NHBE2, whereas fibroblasts, NHBE1 and 
Caco2 all showed enrichments of between 52 and 56%. 
Sequence coverage by nucleosome was calculated accord- 
ing to the formula (number of mapped reads) x (147)/ 
(BAC size) (7) and ranged from 612 to 4079, thus confirm- 
ing adequate coverage of nucleosome positions across the 
BAC for each cell type. 

Comparative analysis of nucleosome positions using 
different methodologies 

We previously developed a method of detecting nucleo- 
some positioning by tiled qPCR of purified 
mononucleosomal DNA (17). This method was applied 
to analysis of the CFTR promoter and the intron 11 
(1811 + 0.1 kb) DHS region (where 1811 is the last coding 
base in exon 11), which contains a potent enhancer of 
CFTR promoter activity (15). It was of interest to 
compare the nucleosome positioning data generated by 
this method and the BAC hybridization method described 
here. There was generally good overlap between nucleo- 
some mapping generated by the two methods across the 
2-kb region analysed (Supplementary Figure SI), with the 
exception of the 500-bp region immediately 5' to the tran- 
scription start site. Where we previously noted three well- 
positioned nucleosomes using the qPCR method, there is 



an apparent depletion of nucleosomes in the BAC 
hybridization-generated data. Nucleosome positioning 
data from the ENCODE consortium (Stanford/BYU) 
show a comparable depletion of signal in this area, which 
is likely because of the >70% GC content of this region. 
The coincidence of nucleosome positioning between the 
two methods was high in the core of the intron 1 1 DHS 
enhancer sequence (Supplementary Figure S2). 

Specific loss of nucleosomes at well-characterized binding 
sites for known trans-acting factors 

Enhancer elements 

We previously identified several enhancer elements 
associated with DHS within introns of the CFTR gene 
that interact with the promoter in a cell-type-specific 
manner. In some cases, we also characterized critical tran- 
scription factors that bind to the elements and contribute 
to enhancer function. A weak intestinal-specific enhancer 
in the first intron of the gene at 185 + 10 kb (185 is the last 
coding base in exon 1) (25) was shown to bind hepatocyte 
nuclear factor 1 (HNF1) both in vitro and in vivo (15,26). 
Inspection of the nucleosome positioning data for this 
region reveals specific loss/repositioning of nucleosomes 
within the core region of this enhancer element in Caco2 
intestinal carcinoma cells (Figure 1A). We next analysed 
the precise region (chr7: 117 130 188-117 130 198) of the 
conserved HNF1 binding site that we showed to be 
critical for the activity of the enhancer (26) and found it 
to coincide with the nucleosome depleted region. 
Enrichment of HNF1 binding at this element in Caco2 
cells was confirmed by ChIP (Figure IB). This region 
was also enriched for the enhancer signature protein 
p300 in ChIP experiments (15). 

A stronger cell-type-selective enhancer exists in intron 1 1 
of the CFTR gene at 1811+ 0.8 kb, which is also enriched 
for p300 and was shown by chromosome conformation 
capture (3 C) to interact directly with the CFTR promoter 
(15,27). ChlP-seq data from the ENCODE consortium 
identify binding sites for a number of transcription 
factors, including FOXA1 and C/EBPp in this core of 
this enhancer element (Figure 2A). This is consistent with 
the depletion/relocation of at least two nucleosomes in this 
region in Caco2 cells in which the intron 11 element is 
highly active (1 5). In contrast, well-positioned nucleosomes 
are seen at this site in skin fibroblast and primary airway 
(NHBE) cells in which the element is inactive. Moreover, 
using ChIP with an antibody specific for FOXA1, we show 
no enrichment of this factor in the DHS11 region 
(Figure 2B), despite the expression of FOXA1 protein in 
these cells (Figure 2C). In contrast, there is strong enrich- 
ment of FOXA1 at the DHS 11 enhancer in Caco2 cells 
(Figure 2B), consistent with our recent data showing 
binding of both FOXA1 and FOXA2 (which recognize 
the same DNA sequence motif) to this region (28). 

Insulator elements 

Enhancer-blocking insulator elements are important for 
establishing chromatin domains and transcriptional hubs 
genome wide and are often located at the functional 
boundaries of individual loci. CCCTC binding factor 
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Table 1. Statistics of SOLiD paired-end DNA sequence analysis 



Cell type/barcode Fibroblast NHBE1 NHBE2 16HBE14o- Caco2 Epididymis 

AGCTTA GTCATC GCATGT AAGTAA GTGCCT TAAAGT 



Unique tags within CFTR BAC 


5 635 722 


4 695 178 


1054913 


6 936 596 


3 121914 


1 040 691 


Unique tags at NON-target region 


4491093 


4 282 556 


2 051 985 


4475238 


2832 115 


371 808 


Unique tags in hgl9 


10126815 


8977734 


3 106 898 


11411 833 


5 954 029 


1412 498 


Non-unique tags in hgl9 


639 090 


262 297 


196 299 


823 494 


361968 


76 100 


Total number of tags in hgl9 


10 765 905 


9 502 327 


3 303 197 


12 235 327 


6315997 


1 488 598 


Enrichment efficiency (%) 


56 


52 


34 


61 


52 


74 


Unique pairs in hgl9 (%) 


94 


94 


94 


93 


94 


95 


Coverage per nucleosome 


3314 


2761 
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4079 


1836 


612 
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Figure 1. Nucleosome positions across CFTR intron 1 (185+ 10 kb) DHS identify the location of a critical HNF1 binding site. (A) Sequence reads 
showing nucleosome positions across CFTR intron 1 (185 + 10 kb) DHS (25) in fibroblasts, NHBE, 16HBE14o- and Caco2 cells. The location of the 
DHS is marked above the reads, and digital DNase hypersensitivity data from ENCODE confirm its location below the reads. Asterisk denotes 
depletion of a nucleosome at the DHS core that corresponds to the HNF1 binding site in Caco2 cells. Below the reads, single-base resolution of the 
region marked by asterisk in (A) shows the HNF1 binding site (26). (B) Enrichment of HNF1 assayed by ChIP is greatest at the intron 1 DHS, 
although it is also seen at other elements across the CFTR locus in Caco2 cells. 
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Figure 2. Nucleosome positions across CFTR intron 11 (1811 +0.8 kb) DHS show the FOXA binding site that contributes to enhancer activity of 
this element. (A) Sequence reads showing nucleosome positions across CFTR intron 11 (1811 +0.8kb) DHS (15) in fibroblasts, NHBE, 16HBE14o- 
and Caco2 cells. The location of the DHS is marked above the reads and digital DNase hypersensitivity data and transcription factor ChlP-seq data 
from ENCODE are shown below the reads. Asterisk denotes depletion of nucleosomes at the DHS core that corresponds to the FOXA binding site 
in Caco2 cells. (B) ChIP with an antibody specific for FOXA1 shows no enrichment in NHBE cells where the DHS 11 enhancer is inactive, despite 
the presence of low levels of FOXA1 protein and strong enrichment of DHS 11 and other c/.?-acting regulatory elements in CFTR in Caco2 cells. 
(C) Relative expression of FOXA1 protein in Caco2, 16HBE14o- and NHBE cells. Western blot of whole-cell lysates probed with the same antibody 
used in ChIP experiments or anti P-tubulin and ImageJ quantitation of a scan of the same blot, relative to the p-tubulin signal. 
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(CTCF) is frequently associated with these insulator 
elements where it interacts with cohesin complex compo- 
nents (29,30). We previously described CTCF-binding, 
enhancer-blocking insulator elements associated with 
DHS at -20.9 kb with respect to the CFTR translational 
start site and at a DHS at +6.8-kb downstream from the 
last coding base (31-33). The — 20.9-kb DHS is seen in 
most epithelial cell types examined and also in some 
other cell types. In contrast, the +6.8-kb DHS is only 
evident in human lung tissue and epididymis epithelial 
cells, although low levels of CTCF binding are also seen 
in Caco2 cells. These observations are markedly consistent 



with the nucleosome positioning data at these sites (Figure 
3 A and B). The CTCF binding site at the core of the 
—20.9-kb DHS is flanked by two well-positioned nucleo- 
somes, with the 5'-peak being more evident in all cases 
than the 3'-peak (Figure 3A). Moreover, these flanking 
nucleosomes seem to have been displaced to increase the 
nucleosome-free region coinciding with the CTCF binding 
site. The usual periodicity of ~ 1 50 bp between nucleosome 
peaks is increased to 270 bp for nucleosomes flanking the 
CTCF binding site. At the +6.8-kb insulator element, the 
nucleosomes flanking the CTCF binding site are also well 
positioned, likely with a loss/repositioning of at least one 
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nucleosome, as the distance between the peaks is ~300 bp 
(Figure 3B). Also of note is the difference in profile of the 
sequence peaks between the flanking nucleosomes in dif- 
ferent cell types. ChIP data for CTCF binding in the DHS 
+6.8-kb region showed 15-fold enrichment in primary epi- 
didymis cells in which the DHS is evident in comparison 
with 6-fold in Caco2 cells that lack the DHS (33). 
Consistent with this observation, complete loss of nucleo- 
somes between the flanking peaks is seen in epididymal 
cells, whereas there is low occupancy of this region in 
Caco2 cells (Figure 3B). The +6.8-kb DHS is also 
evident in adult human lung tissue (34); therefore, it is 
of interest that one culture of primary human airway epi- 
thelial cells (NHBE2) shows a profile similar to the 
primary epididymis, whereas the other (NHBE1) is more 
similar to Caco2. 

Other elements 

We examined nucleosome positioning across multiple 
other cell-type-specific DHS in the CFTR locus. One 
such element is associated with a DHS at 21.5 kb with 
respect to last coding base of the gene and is only 
evident in primary airway epithelial cells (15,16). Again, 
specific depletion of nucleosomes is seen in these cell types; 
however, in this case, at least three nucleosomes are 
affected, two lie adjacent to each other, whereas the 
third is separated from them by two well-positioned nu- 
cleosomes (Supplementary Figure S3). The underlying 
trans-acting factors are currently being investigated. 

Identification of predicted binding sites for novel factors 
that bind in nucleosome depleted regions in vivo 

We also evaluated the nucleosome positioning at a 
cell-type-specific DHS at — 35 kb with respect to the 
CFTR translational start site that is particularly evident 
in 16HBE14o- cells and primary epididymis cells and is 
associated with enhancer activity in airway epithelial cells 
(16). To date, we have not identified the critical transcrip- 
tion factors that interact with this region. However, the 
availability of nucleosome positioning data across this 
region identified BAF155 (a component of the SWI/ 
SNF-related, matrix associated, actin-dependent regulator 
of chromatin structure, SMARC) as a strong candidate 
for interaction with this element. Selective depletion of 
at least three nucleosomes is seen in 16HBE14o- cells at 
the core of the — 35-kb DHS (Figure 4A), a region that 
corresponds to ChlP-seq peaks for C/EBPp" (weak enrich- 
ment) and BAF155 in ENCODE data. We were unable 
to confirm C/EBPp binding to this site by ChIP in 
16HBE14o- cells, although this transcription factor is ex- 
pressed at a high level in these cells (microarray expression 
data not shown). However, the predicted BAF155 occu- 
pancy at the —35-kb DHS was confirmed by ChIP in 
16HBE14o-cells (arrowed in Figure 4B). BAF155 
protein is expressed in 16HBE14o- and Caco2 cells but 
not in fibroblasts (Figure 4C). Consistent with these 
data, a low level of BAF155 enrichment is seen at 
multiple DHS across the locus by ChIP in Caco2 cells 
but not in fibroblasts (Figure 4B). Moreover, inspection 
of the nucleosome positioning profile at the core of the 



—35-kb DHS in Caco2 cells reveals some displacement of 
nucleosomes (marked by arrows in Figure 4A) that may 
correspond to the activity of BAF155 at this site. 

Identification of transcription factor binding sites that are 
overrepresented in regions where a specific nucleosome is 
lost in a cell-specific manner 

Next, we investigated whether we could identify transcrip- 
tion factors de novo that regulated CFTR expression by 
binding to nucleosome-depleted regions, either by them- 
selves displacing nucleosome(s) or by occupying regions 
that were made accessible by the interaction of other 
factors. To achieve this, nucleosome occupancy across 
the CFTR locus in skin fibroblasts (where the gene is 
inactive and shows consistent nucleosome positioning) 
was compared with that of 16HBE14o- cells (where the 
gene is highly expressed) to identify nucleosome-depleted 
regions that were restricted to the latter cell type. The 
sequence of these regions was then evaluated using the 
CLOVER algorithm to look for overrepresented tran- 
scription factor binding sites (Supplementary Table S2), 
using the whole CFTR BAC as a background sequence for 
comparison. This analysis revealed half-sites for multiple 
nuclear hormone receptors including the glucocorticoid 
receptor (GR) and for hepatocyte nuclear factor 4 
(HNF4). To determine whether these sites were of func- 
tional relevance in CFTR expression, we treated 
16HBE14o- cells with dexamethasone (100 nM) for 3h 
and then performed ChIP with an antibody specific for 
GR. As the GR can bind to chromatin as a homodimer 
or in combination with other hormone receptors, changes 
in GR occupancy at predicted nucleosome-free GRE 
homodimer and GRE half binding sites in CFTR were 
then analysed (Figure 5, GRE half-sites and GRE 
homodimers sites in nucleosome-depleted regions are 
shown above the ChIP graph, and the motifs used for 
these predictions are shown in Supplementary Table S3). 
Multiple sites showed enrichment of GR after dexametha- 
sone induction, including in intron 1 chr7: 117 143 050- 
117143 219, intron 10 at chr7: 117 207 782-117 208 258 
and chr7: 117 207 992-117 208 126, intron 21 at chr7: 
117 297 508-117 297 802 and 3' to the gene at +16 kb (at 
chr7: 117 323 222-117 323 391) and +24kb (at chr7: 
117 331607-117 331 776) with respect to the end of the 
coding sequence (Figure 5). In contrast, no changes in 
GR occupancy were seen at any site after dexamethasone 
induction of skin fibroblasts. Moreover, using a RT- 
qPCR assay for CFTR mRNA, dexamethasone was 
shown to decrease CFTR expression under the same ex- 
perimental conditions (Figure 5 inset panel), suggesting 
the observed GR occupancy changes were functionally 
relevant. 



DISCUSSION 

The dynamic competition between nucleosome position- 
ing on the chromatin fibre and the ability of transcription 
factors to bind their recognition sequences and modulate 
gene expression patterns is complex (6,7). Although some 
proteins, such as pioneer factors, have the capability to 



Nucleic Acids Research, 2013, Vol. 41, No. 5 2865 



Scale 
chr7 : 



117084500 I 
-35kbDHS h 



117085500 I 



117086500 I 




Digital DNasel Hypersensitivity Clusters from ENCODE 

Transcription Factor ChlP-seq from ENCODE 
CEBPB 



B 



BAF155 



CFTR ?3 8 10 11_ 18 _ 20 _ 2A 



ii.n 



AJL 



i i. 



4 



JL 



16HBE14o- 



Caco2 



■ I .. i 



Fibroblast 



i ■ ti i 



kDa 
140 _ 
50 — 



p-tubulin 



-50kb 0 +50kb +100kb +150kb +200kb 

Position Relative to CFTR Translational Start Point (10 A 4) 

Figure 4. Nucleosome depletion at the — 35-kb DHS in 16HBE14o- cells identifies a role for BAF155 in CFTR regulation in airway epithelial cells. 
(A) Sequence reads showing nucleosome positions across CFTR —35-kb DHS (15,16) in fibroblasts, NHBE, 16HBE14o- and Caco2 cells. The 
location of the DHS is marked above the reads, and digital DNase hypersensitivity data and transcription factor ChlP-seq from ENCODE are 
shown below the reads. Asterisk denotes depletion of several nucleosomes at the DHS core that correspond to the BAF155 binding site in 
16HBE14o- cells. (B) ChIP with an antibody specific for BAF155 shows enrichment of — 35 kb DHS (arrow) and other cir-acting regulatory 
elements in CFTR in 16HBE14o- and Caco2 cells but not in fibroblasts. (C) Relative expression of BAF155 protein in Caco2, 16HBE14o-, 
NHBE cells and fibroblasts. Western blot of whole-cell lysates probed with the same antibody used in ChIP experiments or anti (3-tubulin and 
ImageJ quantitation of a scan of the same blot, relative to the (3-tubulin signal. 



interact with and disrupt compacted heterochromatin by 
recruiting chromatin-remodelling complexes (35), others 
can only bind to open chromatin. Moreover, all transcrip- 
tion factors must overcome the energetically favourable 
nucleosome core particle by destabilizing its interaction 
with the DNA strand (8,9). Despite the power of 
multiple search engines to predict transcription factor 
binding sites in silico based on DNA sequence alone, 
these bioinformatic methods have their limitations. 
Hence, we examined the potential of using experimentally 
determined nucleosome positioning information across a 



250-kb locus to identify cell-type-specific transcription 
factor binding sites. Applying a novel method to map nu- 
cleosomes in primary human epithelial cells and epithelial 
cell lines, we generated precise nucleosome occupancy 
data across the CFTR locus. Purified mononucleosomal 
DNA from each cell type was hybridized to a 250-kb 
BAC encompassing the locus, purified, amplified and 
sequenced. Sequences were aligned with hgl9, and the nu- 
cleosome positions were then compared between the 
diverse cell types, enabling specific differences in nucleo- 
some occupancy to be correlated with transcription factor 
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interactions. Moreover, bioinformatics approaches to 
reveal TF binding motifs that were overrepresented in nu- 
cleosome-depleted regions identified the involvement of 
the GR in CFTR regulation, a mechanism that was previ- 
ously unknown. 

The application of this method for nucleosome pos- 
itioning for defined regions of the human genome has 
some advantages over the ENCODE genome-wide data 
that are available on the UCSC genome browser (http:// 
genome.ucsc.edu). Specifically the ability to map nucleo- 
somes in chromatin derived from primary cell types for 
which limited material is available and the ability to 
generate greater sequence coverage per nucleosome. 
Moreover, by combining multiple BACs, it should be 
possible to generate a high-resolution map of nucleosome 
positioning over megabase regions of genomic DNA in 
addition to the 250-300 kb regions cloned in single 
BACs. Two potential technical limitations of the method 
should be noted. First, although the use of MNase I to 
map nucleosome positions is generally well accepted, there 
is some evidence (36) that MNase I has a 



sequence-dependent bias. In the context of our experi- 
ments, this is not significant, as we are comparing nucleo- 
some occupancy/depletion in different cell types which all 
have essentially the same genomic sequence. Moreover, 
recent data showed that chemical mapping produces 
nucleosome-positioiiing data that are similar to those 
derived from MNase I (37). Second, the deep sequencing 
approach to nucleosome positioning may reveal appar- 
ently depleted regions that are artifacts of high GC 
content. An example is the presence of a gap of ~500 bp 
immediately 5' to the CFTR transcription start site when 
compared with our previous PCR tiling approach to map 
nucleosomes. However, this gap is also present in the nu- 
cleosome positioning data on the ENCODE browser and 
corresponds to a region of high GC content, which are 
difficult to sequence. The high GC content could also 
potentially influence the kinetics of BAC hybridization. 

The identification of novel factors involved in regula- 
tion of CFTR expression by analysing cell-type-selective 
nucleosome-free regions warrants further discussion. 
First, we show enrichment of BAF155 in vivo at the 
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enhancer element within the — 35 kb DHS. BAF155 is a 
component of the SWI/SNF chromatin-remodelling 
complex, and it is likely recruited to this element by 
other transcription factors. Our ChIP data demonstrate 
enrichment of BAF155 at several DHS across the CFTR 
locus in 16HBE14o- cells and differential relative occu- 
pancy in Caco2 cells, as would be predicted from the 
divergent regulatory mechanisms in these cell types (16). 
Moreover, no BAF155 enrichment is seen at the inactive 
CFTR locus in fibroblasts. 

Next, identification of the GR as a suppressor of CFTR 
expression in 16HBE14o- cells, based on the data 
showing overrepresentation of its binding motif in 
nucleosome-depleted regions in this cell type, is novel. 
Hormonal regulation of CFTR expression has not been 
studied extensively, although there is some evidence for 
its upregulation by oestrogen (38). With the exception of 
an abstract reporting repression of CFTR expression by 
dexamethasone in rat alveolar epithelial cells (39), the 
effects of dexamethasone on CFTR gene expression have 
not been investigated. However, downregulation of gene 
expression by GR is not uncommon [reviewed in (40)]. 
Data generated by the Myers Laboratory at the Hudson 
Alpha Institute for Biotechnology and available on the 
ENCODE browser show GR enrichment by ChIP after 
dexamethasone stimulation of another airway cell line 
A549 (41). This cell line does not express CFTR, and it 
is of interest that although two of the A549 ChlP-seq 
peaks for GR (in introns 10 and 21) coincide with sites 
that were also occupied in 16HBE14o- cells after activa- 
tion by ligand, only one (in intron 21) corresponds to a 
nucleosome-depleted heterodimer binding site for the 
receptor. The possibility that ligand activation of the 
GR downregulates CFTR expression more generally is 
of potential therapeutic relevance. For example, blocking 
ligand:receptor interactions in this well-studied pathway 
could increase CFTR expression in patients with muta- 
tions causing reduced transcript levels. 
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